On Subsampling Procedures for Support Vector Machines
نویسندگان
چکیده
Herein, theoretical results are presented to provide insights into the effectiveness of subsampling methods in reducing amount instances required training stage when applying support vector machines (SVMs) for classification big data scenarios. Our main theorem states that under some conditions, there exists, with high probability, a feasible solution SVM problem randomly chosen subsample, corresponding classifier as close desired (in terms error) obtained from complete dataset. The also reflects curse dimensionalityin assumptions made much more restrictive large dimensions; thus, will perform better lower dimensions. Additionally, we propose an importance sampling and bagging method expands nearest-neighbors ideas previous work. Using different benchmark examples, proposed herein presents faster (without significant loss accuracy) compared available state-of-the-art techniques.
منابع مشابه
STAGE-DISCHARGE MODELING USING SUPPORT VECTOR MACHINES
Establishment of rating curves are often required by the hydrologists for flow estimates in the streams, rivers etc. Measurement of discharge in a river is a time-consuming, expensive, and difficult process and the conventional approach of regression analysis of stage-discharge relation does not provide encouraging results especially during the floods. P
متن کاملOn Transductive Support Vector Machines
Transductive support vector machines (TSVM) has been widely used as a means of treating partially labeled data in semisupervised learning. Around it, there has been mystery because of lack of understanding its foundation in generalization. This article aims to clarify several controversial aspects regarding TSVM. Two main results are established. First, TSVM performs no worse than its supervise...
متن کاملOn Universum - Support Vector Machines ∗
Universum-support vector machine (U-SVM) is an elegant method for 2-class classification problem. It is systematically studied in this paper, including the existence and uniqueness of the primal problem as well as the relation between the solutions of primal problem and dual problem. We find that U-SVM uses 3-class classification approach to solve the 2-class classification problem. So we have ...
متن کاملOn Margin and Support Vector Separability in Support Vector Machines for Regression on Margin and Support Vector Separability in Support Vector Machines for Regression
In this report we show some simple properties of SVM for regression. In particular we show that for close to zero, minimizing the norm of w is equivalent to maximizing the distance between the optimal approximating hyperplane solution of SVMR and the closest points in the data set. So, in this case, there exists a complete analogy between SVM for regression and classiication, and the-tube plays...
متن کاملBinarized Support Vector Machines
The widely used Support Vector Machine (SVM) method has shown to yield very good results in Supervised Classification problems. Other methods such as Classification Trees have become more popular among practitioners than SVM thanks to their interpretability, which is an important issue in Data Mining. In this work, we propose an SVM-based method that automatically detects the most important pre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2022
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math10203776